Automatic Language Identification of Telephone Speech

نویسنده

Marc A. Zissman

چکیده

II Lincoln Laboratory has investigated the development of a system that can automatically identify the language of a speech utterance. To perform the task of automatic language identification, we have experimented with four approaches: Gaussian mixture model classification; single-language phone recognition followed by language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and language-dependent parallel phone recognition. These four approaches, which span a wide range of training requirements and levels of recognition complexity, were evaluated with the Oregon Graduate Institute Multi-Language Telephone Speech Corpus. Our results show that the three systems with phone recognizers achieved higher performance than the simpler Gaussian mixture classifier. The top-performing system was parallel PRLM, which performed two-language, closed-set, forced-choice classification with a 2% error rate for 45-sec utterances and a 5% error rate for lO-sec utterances. For eleven-language classification, parallel PRLM exhibited an 11% error rate for 45-sec utterances and a 21% error rate for 10-sec utterances.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic language identification using large vocabulary continuous speech recognition

We have developed a highly accurate automatic language identification system based on large vocabulary continuous speech recognition (LVCSR). Each test utterance is recognized in a number of languages, and the language ID decision is based on the probability of the output word sequence reported by each recognizer. Recognizers were implemented for this test in English, Japanese, and Spanish, usi...

متن کامل

Phonetic Landmark Detection for Automatic Language Identification

This paper presents a method of augmenting shifted-delta cepstral coefficients (SDCCs) with the classification outputs of an array of support vector machines (SVMs) trained to detect a set of manner and place features on telephone speech. The SVM array allows for broad phoneme classification, and when this information is concatenated with SDCCs to form a hybrid feature vector for each acoustic ...

متن کامل

Perceptual benchmarks for automatic language identification

There has been renewed interest in the eld of automatic language identiication over the past two years. The advent of a public-domain ten-language corpus of telephone speech has made the evaluation of diierent approaches to automatic language identiication feasible. In an eeort to provide benchmarks for evaluating machine performance, we conducted perceptual experiments on 1-, 2-, 4-and 6-secon...

متن کامل

Language identification using acoustic log-likelihoods of syllable-like units

Automatic spoken language identification (LID) is the task of identifying the language from a short utterance of the speech signal uttered by an unknown speaker. The most successful approach to LID uses phone recognizers of several languages in parallel [Zissman, M.A., 1996. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process....

متن کامل

Comparison of four approaches to automatic language identification of telephone speech

AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian mixture model (GMM) classification; single-language phone recognition followed by languagedependent, interpolated n-gram language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and languaged...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1993

Automatic Language Identification of Telephone Speech

نویسنده

چکیده

منابع مشابه

Automatic language identification using large vocabulary continuous speech recognition

Phonetic Landmark Detection for Automatic Language Identification

Perceptual benchmarks for automatic language identification

Language identification using acoustic log-likelihoods of syllable-like units

Comparison of four approaches to automatic language identification of telephone speech

عنوان ژورنال:

اشتراک گذاری